AITopics | dynamic mismatch

Collaborating Authors

dynamic mismatch

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Cross-Domain Policy Adaptation via V alue-Guided Data Filtering

Neural Information Processing SystemsFeb-17-2026, 17:47:12 GMT

Part of this work was done during Kang Xu's internship at Shanghai Artificial Intelligence Laboratory.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.24)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

28f248e9279ac845995c4e9f8af35c2b-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 21:34:41 GMT

artificial intelligence, machine learning, target domain, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.54)
Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Cross-Domain Policy Adaptation via Value-Guided Data Filtering

Neural Information Processing SystemsDec-27-2025, 02:30:23 GMT

Generalizing policies across different domains with dynamics mismatch poses a significant challenge in reinforcement learning. For example, a robot learns the policy in a simulator, but when it is deployed in the real world, the dynamics of the environment may be different. Given the source and target domain with dynamics mismatch, we consider the online dynamics adaptation problem, in which case the agent can access sufficient source domain data while online interactions with the target domain are limited. Existing research has attempted to solve the problem from the dynamics discrepancy perspective. In this work, we reveal the limitations of these methods and explore the problem from the value difference perspective via a novel insight on the value consistency across domains. Specifically, we present the Value-Guided Data Filtering (VGDF) algorithm, which selectively shares transitions from the source domain based on the proximity of paired value targets across the two domains. Empirical results on various environments with kinematic and morphology shifts demonstrate that our method achieves superior performance compared to prior approaches.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis

Neural Information Processing SystemsDec-24-2025, 21:53:42 GMT

Reinforcement learning has shown great promise for synthesizing realistic human behaviors by learning humanoid control policies from motion capture data. However, it is still very challenging to reproduce sophisticated human skills like ballet dance, or to stably imitate long-term human behaviors with complex transitions. The main difficulty lies in the dynamics mismatch between the humanoid model and real humans. That is, motions of real humans may not be physically possible for the humanoid model. To overcome the dynamics mismatch, we propose a novel approach, residual force control (RFC), that augments a humanoid control policy by adding external residual forces into the action space.

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Genre: Research Report > Promising Solution (0.39)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

d9e74f47610385b11e295eec4c58d473-Supplemental.pdf

Neural Information Processing SystemsNov-15-2025, 20:04:27 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.45)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis

Neural Information Processing SystemsNov-15-2025, 16:43:44 GMT

Furthermore, we propose a dual-policy control framework, where a kinematic policy and an RFC-based policy work in tandem to synthesize multi-modal infinite-horizon human motions without any task guidance or user input.

artificial intelligence, deep learning, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada (0.04)
Asia (0.04)
Africa > Mali (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

f76a89f0cb91bc419542ce9fa43902dc-AuthorFeedback.pdf

Neural Information Processing SystemsNov-15-2025, 16:43:31 GMT

We'd like to first thank the reviewers for their constructive feedback. Here we aim to address the main questions raised by the reviewers. RFC policy they are analogous to the goals in DeepMimic. If we don't want the agent to go beyond its ability, then RFC could be extended to a scaffolding technique Also, as shown in the video, when the agent is forced to imitate demonstrations from other agents (e.g., Finally, for agent-object interaction, the RFs won't hinder learning since the policy can always learn The RFs are only applied to stabilize the agent without changing object contact. A: Since the motion synthesis baselines are deterministic, i.e., no diversity (we Besides, the design of the cV AE itself is not the focus of the paper and can be replaced by other models.

artificial intelligence, dynamic mismatch, residual force, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback